Suspicion scoring of networked entities based on guilt-by-association, collective inference, and focused data access1
نویسنده
چکیده
We describe a guilt-by-association system that can be used to rank networked entities by their suspiciousness. We demonstrate the algorithm on a suite of data sets generated by a terrorist-world simulator developed to support a DoD program. Each data set consists of thousands of entities and some known links between them. The system ranks truly malicious entities highly, even if only relatively few are known to be malicious ex ante. When used as a tool for identifying promising data-gathering opportunities, the system focuses on gathering more information about the most suspicious entities and thereby increases the density of linkage in appropriate parts of the network. We assess performance under conditions of noisy prior knowledge of maliciousness. Although the levels of performance reported here would not support direct action on all data sets, the results do recommend the consideration of network-scoring techniques as a new source of evidence for decision making. For example, the system can operate on networks far larger and more complex than could be processed by a human analyst. This is a follow-up study to a prior paper; although there is a considerable amount of overlap, here we focus on more data sets and improve the evaluation by identifying entities with high scores simply as an artifact of the data acquisition process. Contact: Sofus A. Macskassy Stern School of Business Department of Information, Operations & Management Sciences New York University New York, NY 10012-1126, USA Tel: 1-212-998-0584 Fax: 1-212-995-4228 Email: [email protected]
منابع مشابه
Suspicion scoring of networked entities based on guilt - by - association , collective inference , and focused data access
We describe a guilt-by-association system that can be used to rank networked entities by their suspiciousness. We demonstrate the algorithm on a suite of data sets generated by a terrorist-world simulator developed to support a DoD program. Each data set consists of thousands of entities and some known links between them. The system ranks truly malicious entities highly, even if only relatively...
متن کاملA Brief Survey of Machine Learning Methods for Classification in Networked Data and an Application to Suspicion Scoring
This paper surveys work from the field of machine learning on the problem of within-network learning and inference. To give motivation and context to the rest of the survey, we start by presenting some (published) applications of within-network inference. After a brief formulation of this problem and a discussion of probabilistic inference in arbitrary networks, we survey machine learning work ...
متن کاملThe Effectiveness of Compassion Focused Therapy (CFT) on Shame and Feeling of Guilt Among Women with Sexual Abuse Experience in Childhood
Aim: Compassion-focused therapy (CFT) is developed for clients who experience high levels of shame and self-criticism. CFT emphasizes the centrality of our affiliative system in reducing threat-based processing by allowing us to feel cared for and able to offer care to both ourselves and others. The aim of the current study was to investigate the effectiveness of CFT on shame and feeling of gui...
متن کاملP23: The Investigation of the Obsessive-Compulsive Disorder Severity Based on Self-Focused Attention Styles and Sense of Guilt in Students
Several studies suggest that obsessive-compulsive disorder (OCD) is common among college students. Therefore, identification of factors contributing to the symptoms of this disorder is considered one of the most important issues in the field of education. The purpose of this study is to predict the severity of OCD based on self-focused attention styles and sense of guilt in students. Sample gro...
متن کاملIs Guilt by Association a Bad Thing?
In this paper we study a classification model that mimics guilt by association. We consider a population consisting of known adversaries, covert adversaries, and benign individuals. In our model we associate a suspicion score with each individual. The scores for the covert and benign populations are initially set to 0, while the known adversaries have a fixed score of 1. The scores change dynam...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2005